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Random Boolean networks were introduced in 1969 by Kauffman as a model for gene regulation. 
By combining analytical arguments and efficient numerical simulations, we evaluate the properties 
of relevant components of critical random Boolean networks independently of update scheme. As 
known from previous work, the number of relevant components grows logarithmically with network 
size. We find that in most networks all relevant nodes with more than one relevant input sit in 
the same component, while all other relevant components are simple loops. As the proportion of 
nonfrozen nodes with two relevant inputs increases, the number of relevant components decreases 
and the size and complexity of the largest complex component grows. We evaluate the probability 
distribution of different types of complex components in an ensemble of networks and confirm that 
it becomes independent of network size in the limit of large network size. In this limit, we determine 
analytically the frequencies of occurence of complex components with different topologies. 
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1. INTRODUCTION 

Random Boolean networks are often used as generic 
models for the dynamics of complex systems of interact- 
ing entities, such as social and economic networks, neural 
networks, and gene or protein interaction networks [1]. 
The simplest and most widely studied of these models 
was introduced in 1969 by Kauffman [2] as a model for 
gene regulation. The system consists of N nodes, each 
of which receives input from K randomly chosen other 
nodes. The network is updated synchronously, the state 
of a node at time step t being a Boolean function of the 
states of the K input nodes at the previous time step, 
t — 1. The Boolean update functions are randomly as- 
signed to every node in the network, and together with 
the connectivity pattern they define the realization of the 
network. For any initial condition, the network eventu- 
ally settles on a periodic attractor. Of special interest 
are critical networks, which lie at the boundary between 
a frozen phase and a chaotic phase [3-5]. In the frozen 
phase, a perturbation at one node propagates during one 
time step on an average to less than one node, and the at- 
tractor lengths remain finite in the limit N — > oo. In the 
chaotic phase, the difference between two almost identi- 
cal states increases exponentially fast, because a pertur- 
bation propagates on an average to more than one node 
during one time step [6] . 

During the last few years, great progress has been 
made in the understanding of critical networks with 
K = 2 inputs per node, see [7-15]. They contain three 
classes of nodes, which behave differently on attractors. 
First, there are nodes that are frozen on the same value 
on every attractor. Such nodes give a constant input 
to other nodes and are otherwise irrelevant. They form 
the frozen core of the network. Second, there are nodes 
without outputs and nodes whose outputs go only to ir- 
relevant nodes. Though they may fluctuate, they are 
also classified as irrelevant since they do not influence 
the number and periods of attractors. Third, there are 



relevant nodes, which belong to the nonfrozen nodes and 
lie on directed loops built of links between relevant nodes, 
or they connect such loops by chains of relevant nodes. 
These nodes determine completely the number and pe- 
riod of attractors. A connected set of relevant nodes is 
called a relevant component. The relevant subnetwork 
consists of the disjoint relevant components. The non- 
frozen nodes that are not relevant sit on trees rooted 
in the relevant components. Numerical simulations and 
analytical estimation in [8] showed that the number of 
nonfrozen nodes scales in the limit N — > oo as N 2 ^ 3 
and the number of relevant nodes as TV 1 / 3 . An ana- 
lytical study of the distribution of attractor lengths in 
the limit of big network size in [9] is shown in [12] to 
contain the same result. The corresponding probability 
distributions for the number of nonfrozen and relevant 
nodes in terms of scaling functions have been obtained 
in [13, 14] by means of combination of analytical calcu- 
lations and numerical simulations. One application of 
analytical approach developed in [15] extends the find- 
ings of [8] and gives slightly better numerical approxi- 
mations for the scaling functions. In [13], it was addi- 
tionally shown that the number of nonfrozen nodes with 
two relevant inputs scales as N 1 / 3 , and that the number 
of relevant nodes with two relevant inputs remains finite 
in the limit N — > oo. Probability distributions for these 
two quantities were also given. From these results it fol- 
lows that just a few components are complex and include 
relevant nodes with two relevant inputs, while most rele- 
vant components are simple loops whose number is pro- 
portional to In y/ N n f for large networks, N n f being the 
mean number of nonfrozen nodes. The nonfrozen part of 
a critical K = 2 network is much like a critical network 
with K = 1 inputs per node and only the two nontrivial 
Boolean functions "copy" and "invert" . All N nodes in 
those networks are nonfrozen, and the relevant nodes are 
arranged in simple loops. The mean number of relevant 
nodes is proportional to y/N [15, 16], the mean number 
of loops equals In y/N for large networks, the number of 
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loops of length I is Poisson distributed with mean 1 / 1 for 
/ "C V^V"[12], the asymptotic probability distribution for 
the number of relevant nodes is [13] 

The number and period of attractors of a network is 
given by the number and length of attractors of its rele- 
vant subnetwork. These arc obtained by combining the 
attractors of the individual relevant components. If all 
relevant components were simple loops, it would already 
follow that the number and length of attractors increases 
with system size faster than any power law [12, 17]. Com- 
plex components make the number of attractors even 
larger, since they possess themselves exponential num- 
bers and periods of attractors [18]. Therefore, complex 
components are important for the understanding of dy- 
namics of large random Boolean networks (RBNs). 

Complex components are of interest also for other rea- 
sons. As shown in [19], Boolean network models can be 
appropriate models for the dynamics of genetic regula- 
tory systems. Specialized functional blocks should play 
an important role in such models, and functional blocks 
can be modelled as complex components, when dynam- 
ics can be treated as Boolean. Since the determinis- 
tic parallel update rule used in many Boolean network 
models is not biologically realistic, other, and in partic- 
ular stochastic updating schemes have been investigated 
[11, 20]. While they affect the number of attractors, they 
do not influence the classification in frozen, nonfrozen, 
and relevant nodes. The relevant components and the 
position of critical line in the phase diagram [5, 6] of 
Boolean networks are the same for all update rules [10]. 
Components and motifs (smaller units of nodes) play an 
important role at determining whether an attractor is 
robust with respect to stochastic fluctuations [21]. 

Because of their importance for the dynamics of 
Boolean networks, we study in this paper complex rel- 
evant components of RBNs with K = 2. Future work 
will have to move on to more realistic network structures. 
In the following, we will prove in section 2 that in most 
large networks all nodes with two relevant inputs reside 
in the same complex component, and we will use this 
result to estimate analytically the number of complex 
components in a network. Furthermore, simulation re- 
sults are presented and discussed. In section 3 we classify 
complex components and discuss their probability distri- 
bution both analytically and by using numerical simula- 
tions. Finally, in section 4 we list all possible complex 
components with one and two nodes with two relevant 
inputs and calculate their frequencies of occurence. A 
summary of our findings is given in section 5. 



2. NUMBER OF RELEVANT COMPONENTS 

In this section, wc discuss the number of relevant com- 
ponents in critical networks, with a special focus on com- 



plex relevant components. We will first describe the 
stochastic procedure leading to the relevant nodes and 
the components formed by them, which is the basis of 
our computer simulations and analytical considerations. 
Then, we will argue that most networks have not more 
than one complex relevant component, and we will ob- 
tain the probability distribution for the total number of 
relevant components. 



A. Algorithm for obtaining relevant nodes and 
relevant components 

In the subsequent sections, we will present numerical 
simulations of the ensemble of relevant subnetworks in 
a class of critical RBNs with two inputs per node de- 
fined in the following way. The probability (3 to assign a 
frozen Boolean function (for which the output value does 
not depend on the input values) to a node is equal to 
the probability to choose a reversible Boolean function 
(for which the output changes whenever any input value 
changes). By 7 we denote the probability of choosing a 
canalyzing function that yields one value for three differ- 
ent input values combinations and once the other output 
value. The remaining proportion 1 — 2/3 — 7 of functions 
are those that respond only to one input. 

We will use an efficient method presented in [13], which 
was developed for calculation of scaling properties of the 
number of nonfrozen and of relevant nodes. In [13], sta- 
tistical properties of the ensemble of networks were eval- 
uated, and appropriate stochastic processes were used to 
determine the number of frozen and relevant nodes for 
a given network. The frozen nodes were determined by 
starting with nodes with frozen functions and by iter- 
ativcly determining nodes which become frozen due to 
being connected to a frozen node. The outcome was the 
probability distribution 
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with N n f being the number of nonfrozen nodes. Most of 
these nonfrozen nodes have only one relevant input, and 
the number of nonfrozen nodes with two relevant inputs 
A r 2 is distributed according to 
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with a = N2 1 \J N n f ■ Then, only nonfrozen nodes and 
their connections were considered. The idea for identifi- 
cation of relevant nodes is to first identify nodes without 
outputs as irrelevant nodes and then to determine iter- 
atively all other nodes whose outputs go only to irrele- 
vant nodes. These nodes arc also irrelevant. The actual 



3 



stochastic procedure for obtaining relevant nodes in [13] 
is more involved, see original paper for details. For the 
scaling variable 
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one gets a probability distribution P{z) the shape of 
which depends on the parameters (3 and 7. P(z) decays 
exponentially for large z. An explicit expression for P(z) 
in terms of (3) and another scaling function occuring in 
the stochastic process is given in Eq. (29) of [13]. 

Most relevant nodes have one relevant input. The full 
ensemble probability P(m; z) for (4) to be in the range 
[z,z + dz], while m of the relevant nodes have two rel- 
evant inputs, depends again on (3 and 7. An important 
observation is that P(m; z) does not depend on N in the 
limit of large network size. The mean number of relevant 
nodes with two relevant inputs is therefore finite in this 
limit. 

The described procedure from [13] provides us with 
the number of relevant nodes N re i and the number of 
relevant nodes with two relevant inputs to. Connecting 
these nodes randomly generates a network from the en- 
semble of relevant subnetworks. A finite fraction e of rel- 
evant nodes are arranged in simple loops. Simple loops 
of length I appear [13] with probability 



P{1) = I /I for I < l c with l c 
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in the limit N re i — > 00. The other (1 — e)N re i relevant 
nodes sit on complex components containing nodes with 
two relevant inputs. 



2to nodes do not form a fully connected directed graph. 
We denote this probability by G m . The strategy for cal- 
culating G m is the following. We look at all possible par- 
titionings of connected subgraphs of the 2m nodes with 
connections and sum the contributions to the probability 
coming from 1,2,3,... fully connected subgraphs. Each 
of those contributions itself includes a sum of different 
partitionings with a fixed number of subgraphs. The last 
contribution in the overall sum comes from a partition- 
ing in m connected pairs of nodes, one with two inputs 
and one output and one with two outputs and one in- 
put. If we do this, we get Gi = 0, G2 = 0.1, G3 =0.1, 
G4 w 0.08, G5 « 0.07. The general formula reads 
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with N m = (3m) ! G rn , 



where the sum is over all nontrivial partitions of to nodes 
in m groups of i nodes, to = To estimate the 

asymptotic behavior for large to, we note that the largest 
contribution for large to comes from the partitioning in 
one connected pair and a fully connected rest. We ap- 
proximate G m by the contribution from a partitioning 
in one connected pair and an arbitrarily connected rest: 
6to 2 (3(to, — 1))!/(3to)! — > 2/(9to) for large to. We con- 
clude that all nodes with two relevant inputs are usually 
gathered in one complex component, even for small val- 
ues of TO. 



C. Total number of relevant components 



B. Number of complex relevant components 

We argue in the following that most relevant nodes 
with two relevant inputs reside in one component. We 
calculate the probability that this is not the case and 
show it to be small and to decrease as 1/to for large to. 

We neglect the cases where one relevant node has two 
relevant inputs and two relevant outputs, as well as where 
a node has more than two relevant outputs. The proba- 
bility for such nodes vanishes in the limit of large network 
size. Then, the number of relevant nodes with two rele- 
vant outputs is identical to the number of relevant nodes 
with two relevant inputs to. 

In the following, we first fix the value of to. Later, 
we will perform summations over to using the probabil- 
ity distribution p(m). For a given to, the ensemble of 
relevant networks can be constructed by connecting the 
2to nodes with 2 relevant inputs or two relevant outputs 
amongst one another, each realization of connections ap- 
pearing with equal probability, and by subsequently in- 
serting the nodes with one relevant input into the existing 
connections. The probability that the to nodes are not 
in the same component is thus the probability that the 



We build the relevant components by first ignoring one 
input of each nonfrozen node that has two inputs. The 
second inputs will be connected later. We thus first build 
a K = 1 critical network from the nonfrozen nodes, many 
properties of which arc known from the literature. In par- 
ticular, all nonfrozen nodes will at this stage be arranged 
in 

G ~ l/2(ln(27V n/ ) + lE ) , (7) 

simple loops and trees rooted in loops [22]. 7^ r* 0.577 
is the Eulcr-Mascheroni constant. The width of the dis- 
tribution of the number of loops is asymptotically 

<7 2 = C-7r 2 /8. (8) 

Equations (7), (8) are valid for fixed values of N n f . If 
we insert for N n f the mean value N n f « 0.62(iV//3) 2 / 3 
(see equation (12) in [13]), we get G « 6.4 for N = 2 23 . 
If we use the full probability distribution for the num- 
ber of nonfrozen nodes and the number of simple loops, 
we obtain the following formula for the mean number of 
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simple loops: 
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which is a combination of exact results (II.C16) and 
(II.C17) for the ensemble of K = 1 networks from [22], 
and of (1). We designed a program to effectively evalu- 
ate (9) numerically. For the above-mentioned system size 
N = 2 23 we get (7=6.13. We use this value in the fol- 
lowing discussion. Note that using mean values instead 
of full probability distributions gives good estimations of 
the order of magnitude in the present context. 

Now we estimate the number of components out of the 
C simple loop components on which the relevant nodes 
with two relevant inputs and the nodes with two relevant 
outputs sit. By assuming that all of them will end up 
on the same complex component when the second inputs 
will be connected, we obtain our desired result for the 
mean number of relevant components. In the following, 
we approximate the mean value of some functions by the 
functions of the mean values, so that the results (10) and 
(11) are valid approximately 

Because of (5), the mean number of components with 
loops of lengths from logarithmically constant intervals 
will be constant. If l\ and I2 denote the lower and upper 
boundary of an interval, there is on an average one loop 
in an interval of logarithmic size I2/I1 = e. We consider 
such intervals with the upper boundaries I2 equal to c, 
c 2 , and so on. For two neighboring intervals, the factor 
between the numbers of nodes on the loops is e on an av- 
erage. Since for an interval the total number of nodes on 
the loops and on their trees is proportional to the num- 
ber of nodes on the loops, the factor between the total 
numbers of nodes in two neighboring intervals is also e. 
With this assumption, the proportion of nodes in the pth 
largest component is Ae - ^ -1 - 1 , with A = 1 — 1/e R* 0.632. 
We assumed that e~ c -C 1. The value A = 1 — 1/e is 
not far from the numerically determined value A sa 0.624 
for the proportion of nodes in the largest component in 
ensembles of K = 1 networks with any large but fixed 
number of nonfrozen nodes. Let us now turn to the m 
relevant nodes with two relevant inputs, with one input 
being ignored. The number of nodes with two relevant 
inputs or two relevant outputs in the largest nonfrozen 
component is then of the order 2mA, and the smaller 
components will contain 2mA/e, 2mA/e 2 , . . . , 1 nodes 
with two relevant inputs. For the purpose of estimation, 
we neglect further the fact that the distribution of the 
number of nodes with two relevant inputs p(m) depends 
[13] on the number of relevant nodes and, therefore, cor- 
relates with the number of nonfrozen nodes on the loops. 

The total number n(m) of nonfrozen components 
with nodes with two relevant inputs (one of them is 
cut off) is consequently obtained from the condition 



2mA c (™( m ) !) — 1 for m > 1. For m = 1, this gives 
?i(l) ~ 1.234. The exact result ra(l) = 4/3 is within 10 
percent of this estimate. To explain the exact result for 
fixed m = 1 we consider all possible relevant components 
containing the one node with two relevant inputs, which 
result after connecting the cut off input. They are repre- 
sented in figure 5. Clearly, the two relevant inputs could 
have been cut off with the same probability. As explained 
in section 4, relative frequency of occurence for the sec- 
ond component in figure 5 is 1/3, and cutting one input 
off the node with two inputs leads to one nonfrozen com- 
ponent. Such components have therefore resulted from 
one nonfrozen component. For the first component in fig- 
ure 5, which appears with relative probability 2/3, cut- 
ting one input leads to two nonfrozen components with 
probability 1/2. The mean number of K = 1 nonfrozen 
components connected by the relevant node with two rel- 
evant inputs is then 1 * 2/3 + 2 * 1/3 = 4/3. 

Taking all results together, we obtain the following es- 
timate for the mean number of nonfrozen components 
with relevant nodes with two relevant inputs: 
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p(m)n(m) w p(m) In (2m(e — 1)) . (10) 



We recall that (almost) all nodes with two relevant inputs 
are gathered in one complex component, and with (10) 
we get an estimation for the mean number of relevant 
components, which is valid for large networks: 
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C&= 2 (iV) « C(N) - ]T p(m)(n(m) - 1) 



rn—1 



C(N) - In m - (1 -j>(0))ln(2A). (11) 

We support (11) by evaluating the number of relevant 
components in numerical simulations, see figure 1. For 
comparison with (11), some results from section 3 have 
been used, sec figure 3 there. 

The average number of complex components per net- 
work can be extracted from results presented in figure 1 
as the difference between the number of relevant compo- 
nents and the number of simple loop components. For 
model parameters /3 = 1/8 and 7 = 0,1/4,1/2 we get 
0.6, 0.7 and 0.8 complex components on average respec- 
tively. For the three cases, in section 3 we show that the 
probability to have no nodes with two relevant inputs in 
a network is 0.51, 0.32 and 0.27, so that the average num- 
ber of complex components in the networks with complex 
components evaluates to 1.2, 1.0 and 1.1 respectively, jus- 
tifying the assumption that there is usually no more than 
one complex component. Statistical errors of simulation 
results consistently allow for errors in the first digit after 
the comma. 

The difference between simulation results in figure 1 
and the results from (11) is smaller for smaller values 
of 7, where the approximations used to obtain (11) arc 
intuitively better. An estimation in the spirit of (10) 
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FIG. 1: £e/T figure: Probability distribution for the number 
of relevant components per network. Results were averaged 
over 10 4 network realizations for the following model param- 
eters: P = 1/8; 7 = 0, 1/4, 1/2; TV = 2 23 ~ 8 ■ 10 6 for 7 ^ 1/2 
and TV = 2 22 for 7 = 1/2. Obtained in simulations results 
for mean numbers of components for 7 = 0, 1/4, 1/2 are 5.7, 
5.1, 4.5 respectively. Using results of simulations or analyti- 
cal results for p(m), see section 3 for details, and (11) we get 
5.5, 4.5, 3.6. The agreement between theoretical estimation 
and simulations is satisfactory. Half maximal widths of the 
simulation curves are 5.3, 5.0, 4.6 compared to values 4.43, 
4.43, 4.32, determined from (8), whereby m-dependence has 
been ignored. 

Right figure: Number of simple loop components per network 
in K — 2 critical RBNs for the same sets of model parameters 
and numerical simulation parameters as in the left figure. Our 
simulation results for the mean number of simple loop com- 
ponents in the largest considered networks for 7 = 0, 1/4, 1/2 
are 5.1, 4.4, 3.7 respectively. From the estimation (11) we 
get 5.0, 4.2, 3.3, since all nodes with two relevant inputs are 
assumed to reside in one complex component as explained in 
the main text. 



and (11), which would take into account that the prob- 
ability for relevant nodes with two relevant inputs to be 
arranged in more than one relevant complex component 
is small but greater than zero, would not change (10), but 
it would lead to slightly smaller values of C^ < e f 2 (N, (3, 7) 
in (11). This observation partially explains the differ- 
ence between the simulation and the estimation results 
in figure 1. 



3. DISTRIBUTION OF COMPLEX 
COMPONENTS 

The findings of the last section together with the re- 
sults in [13] enable us to evaluate the mean number of 
relevant nodes with two relevant inputs, the probabil- 
ity distribution for the number of relevant nodes with 
two relevant inputs, and the mean number of relevant 
complex components per network with a given number 



FIG. 2: The mean number of nodes with two relevant inputs 
for P = 1/8 as function of the network size TV. The averaging 
was performed over 10 4 network realizations. 



of nodes with two relevant inputs. The complex com- 
ponents have nontrivial dynamics and can be associated 
with functional blocks in real genetic networks. 

Let us first evaluate the number of relevant nodes with 
two inputs. Using Equations (23), (24), (29), (31) from 
[13], we can write its probability distribution as 



p(m) 



P(m; z)dz 



{ml) 2 



00 {^_/2) m P(z) 



(12) 



where I n (z) denotes the modified Bessel function of 
the first kind and satisfies the relations Iq(x) = 
TZ=,{^? ^d h{x) = 2/,E: =0 m(^)l 

The mean number of nodes with two relevant inputs is 
then finite and is given by 



z - ^ ' Jl jlp) ' p( z )dz . (13) 



The integral converges since P(z) decreases exponentially 
for large z (compare [13]), and the rest of the integrand 
grows faster than linearly but slower than quadratically 
with z. figure 2 shows results of computer simulations 
for the values of m. With increasing TV, they approach a 
constant value. 

For further convenience, we introduce the notation 
K = 1 + 7//?. The dependence of m and p(m) on the 
model parameters (3 and 7 is fully determined by the de- 
pendence of P(z) on k. Since P(z) becomes broader for 
larger k (therefore assuming smaller values at small z), 
see [13], fh increases with n. This means that increasing 
k leads to more relevant nodes with two relevant inputs. 
We will look at the dependence of p(m) on k in more 
detail later. 
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FIG. 3: Simulation results for the number c m (n m ) of com- 
ponents per network with n m nodes with two relevant inputs 
(dotted lines). Simulation results for the probability distribu- 
tion p(m) for the number m of nodes with two relevant inputs 
per network (solid lines). Theoretical prediction for the prob- 
ability distribution p(m) (dashed line). The model parame- 
ters used are = 1/8 and 7 = (circles, squares) or 7 = 1/4 
(rhombi) or 7 = 1/2 (triangles). All simulations were run 
with 8000 different network realizations and results were av- 
eraged afterwards. Network size was N ~ 10 7 (dotted lines), 
N ~ 10 5 (solid lines). Approximation P^ {z) = 0.62 e" 652 
was used to obtain the dashed curve. 



We turn to the discussion of the connection between 
p(m) and the number of components with a given num- 
ber of nodes with two relevant inputs. From section 2 we 
know that, except for a small correction, most nodes with 
two relevant inputs reside in one complex component. 
Therefore, the fraction of number of networks, where we 
find a given value of m will simultaneously be the frac- 
tion of number of networks where we find complex com- 
ponents (one complex component) with this number of 
nodes with two relevant inputs. In figure 3, the dotted 
lines represent the mean number c m of components per 
network with a given number n m of nodes with two rel- 
evant inputs, obtained in our computer simulations by 
counting the number of such components in the gener- 
ated network ensemble for different model parameters. 
To verify that the relevant nodes with two relevant inputs 
usually sit in the same component, we also evaluated the 
probability distribution p(m) numerically (solid lines in 
figure 3). For m > 1 and n m > 1, the difference between 
the solid and the corresponding dotted lines in figure 3 
is mainly due to statistical fluctuations. They become 
larger for larger networks, we therefore used compara- 
tively small networks with N ~ 10 5 to obtain the solid 
lines. In the rare network realizations with two complex 
relevant components (appearing more often with increas- 
ing m) the smaller complex component has mostly one 
relevant node with two relevant inputs (to = 1), see the 
data points for to = n m = 1 in figure 3. Finally, we 
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FIG. 4: The number of relevant complex components with m 
nodes with two relevant inputs is constant for large networks. 
In the figure, circles, diamonds and triangles correspond to 
model parameters /3 — 1/8 and 7 = 0, 1/4, 1/2 respectively. 
All simulations were run with 8000 different network realiza- 
tions. 



calculated p(m) analytically with (12). The theoretical 
dashed curve in figure 3 agrees well with the simulation 
results. We only calculated analytical results for 7 = 0. 
That the number of components with a given value of 
m is indeed independent of the system size is shown in 
figure 4. 

To complete our understanding of simulation results in 
figure 3 we point out the connection to the results from 
section 2. For m = the solid lines represent the frac- 
tion of networks without nodes with two relevant inputs. 
We used the results for these fractions 0.51, 0.32, 0.27 
for figure 1 . Network realizations without relevant nodes 
with two relevant inputs are indistinguishable from net- 
works in the model with K = 1 . In this context we want 
to mention that the fraction of "frozen" networks with- 
out relevant nodes ~ (/3/7V) 1 / 3 is negligible in the limit 
N — > 00, whereas lim z _>o P{z) 7^ 0. The dotted curves 
at n m = represent the number of simple loops per net- 
work. For 7 = 0, 0.25, 0.5 this number is 5.1, 4.4, 3.7 
respectively. These values are in good agreement with 
those calculated with (11), see figure 1. 

Let us conclude this section by discussing the depen- 
dence of p(m) on the parameter k. For fixed k, p(m) is 
a monotonously decreasing function, which is flatter for 
larger k. For fixed to, the value p(m) as function of k 
has a maximum, the position of which moves to larger k 
with increasing m. We see an indication of this in fig- 
ure 3 and figure 4, where the data points for k = 1 are 
not the highest ones when m is larger than 4. Stronger 
support comes from an analysis of Eq. (12). The into- 



7 



grand apart from P(z) has one global maximum, which 
shifts to larger values of z as m increases. The function 
P(z) is monotonically decreasing, being flatter and start- 
ing at a smaller value for larger k. From these properties 
of the two contributions to the integrand, we find imme- 
diately that the value p(m) at a given m as function of 
k has a maximum, which moves to larger k with increas- 
ing m. For very large k, the function P{z) in (12)can be 
approximated [13] by its value P[z = 0) = \Z2n/ (4k 1 / 3 ) 
for not too large m, revealing how p(m) decreases with 
increasing k. 

From these considerations it follows that complex com- 
ponents with a small number of nodes with two relevant 
inputs will appear with smaller probability if we increase 
«, while complex components with larger m ~ m occur 
more often. 



CHD 

e 

FIG. 5: Schematic representation of the two topologically dif- 
ferent relevant complex components with two nodes with two 
relevant inputs. Arcs stand for one link or a chain of nodes 
with one relevant input. Each crossing of arcs marks a node 
with two relevant inputs or outputs. Arrows depict the inputs 
of nodes with two relevant inputs, they appear only where 
they are important to distinguish the complex component. 
The first component appears in a network twice as often as 
the second one. 



4. RELATIVE FREQUENCIES OF 
TOPOLOGICALLY DIFFERENT COMPONENTS 

So far, we characterized relevant complex components 
by the number of nodes with two relevant inputs found in 
them. We determined, which complex components and 
with what probability are found in a network asymptot- 
ically. Now, we want to consider the topology of such 
complex components. 

We consider two complex components as different if 
they possess a different topology. Components with the 
same topology but with different numbers of nodes in the 
different chains of relevant nodes with one relevant input, 
have similar dynamics, their main difference being differ- 
ent time delays along chains of nodes. From our study of 
simple examples of complex components in [18], we know 
that changing the length of these chains docs not change 
the types of attractors and the asymptotic dependence 
of the mean number and length of these attractors on 
the component size. Therefore, we restrict ourselves to 
considering all possible different interconnections of rele- 
vant nodes with two relevant inputs with relevant nodes 
with two relevant outputs, i.e., the different topologies, 
irrespective of the lengths of the chains between them. 

We first consider complex components with one node 
with two relevant inputs, that is the case m = 1. The 
two different possibilities to construct these components 
are the ones studied in [18] and schematically shown in 
figure 5. In our simulations, the relative probability of 
obtaining two simple loops with a chain of nodes between 
them in a relevant network is 2/3, and the relative prob- 
ability of obtaining a simple loop with an extra link is 
1/3. For small values of m, these relative frequencies of 
the different possible complex components can easily be 
determined exactly. One has simply to count the number 
of different ways to connect the nodes with two relevant 
inputs or outputs out of the total number (3m)! leading 
to the considered component. For the case of figure 5, 
the number of ways to construct the two components are 
4 and 2 respectively. 



The same counting of ways to construct complex com- 
ponents for m = 2 leads to the table presented in figure 6. 
To get the corresponding relative frequency of occurrence 
one has to divide the multiplicity in figure 6 by 6!. The 
first three rows in the figure correspond to distributing 
nodes with two relevant inputs or outputs over more than 
one relevant component, compare discussion in section 2. 
The dynamical behavior of the components in figure 6 
can be studied in details similarly to [18], if required. It 
is by analogy clear, though, that one would get exponen- 
tially many exponentially long attractors with increasing 
system size. 

5. CONCLUSIONS 

In this paper, we have investigated the properties 
of relevant components of critical random Boolean net- 
works. We used an efficient numerical method based on 
[13] to create a sufficiently large ensemble of the relevant 
parts of sufficiently large networks in order to evaluate 
the number of relevant components, the number of rele- 
vant nodes with two relevant inputs, and the number of 
components with a given number of nodes with two rel- 
evant inputs. The results are in agreement with theoret- 
ical predictions made in [13], and they are supplemented 
by additional analytical results. 

Our main findings are the following: 

• The number of relevant components increases log- 
arithmically with the system size, and usually only 
the largest relevant component is complex, i.e., is 
not a simple loop. 

• For large network sizes, the number of relevant 
nodes with two relevant inputs and the relative fre- 
quencies of different types of complex components 
become independent of the network size. 

• The relative frequency of topologically different 
complex components with the same number of 
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nodes with two relevant inputs can be obtained 
from simple combinatorial considerations. 

• At constant network size, when the mean number 
of nonfrozen nodes with two relevant inputs grows 
(that is, for larger k = 1 + 7 / '/?), the number of 
relevant components decreases roughly by the mean 
value of the logarithm of the number of relevant 
nodes with two relevant inputs, while the size of 
the largest relevant component and the number of 
nodes with two relevant inputs in this component 
becomes larger. 

Since the relevant part constitutes only a vanishing 
portion of the network (the fraction 7V~ 2 / 3 of all nodes), 
and since the different relevant components change their 
state independently of each other, the topology of consid- 
ered types of networks is most likely very different from 
the topology of real biological networks, such as genetic 
regulatory networks, where one would expect that the 
majority of nodes is relevant or at least not always frozen, 
and that different parts of the network are not decoupled 
from each other. Furthermore, we have not addressed 
the question of the function of a complex component in 
biological context, which has to be considered together 
with a suitable definition of the environment and together 
with an appropriate choice of update rules. A lot of work 
still needs to be done before real genetic regulatory and 
other biological networks can be modelled. 



FIG. 6: All possible topologically different components with 
m = 2, together with their multiplicities (number of ways to 
construct them). Arcs stand for one link or a chain of nodes 
with one relevant input. Each crossing of arcs marks a node 
with two relevant inputs or outputs. Filled points represent 
exactly one node. Arrows depict the inputs of nodes with two 
relevant inputs, they appear only where they are important 
to distinguish the complex component. 
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